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METHOD AND APPARATUS FOR THE DIGITIZATION OF AND 
FOR THE DATA COMPRESSION OF ANALOG SIGNALS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

[0001 ] This application claims priority of International Application 

No. PCT/EP2004/008217, filed July 23, 2004 and German Application No. 103 34 357.1, 

filed July 25, 2003, the complete disclosures of which are hereby incorporated by reference. 

BACKGROUND OF THE INVENTION 

a) Field of the Invention 

[0002] The present invention is directed to a method and apparatus for the digitization of 
analog signals and a method and apparatus for data compression of analog signals. 

b) Description of the Related Art 

[0003] In many fields of digital information transmission there exists the common goal of 
transporting an initially analog source signal (measuring, studio-quality audio signal 
processing, and so on) to a receiver while making use of the advantages of digital 
transmission in order to reproduce this analog source signal in the receiver in the form of an 
analog output signal. For this purpose, the analog signal must be digitized and quantized on 
the one hand and a corresponding data compression must be carried out on the other hand in 
order to transmit the signal with the highest possible quality via a digital transmission system 
with a limited rate. After a PCM encoding, methods for data compression are usually applied 
which make use of the redundancy within the source signal and irrelevance with respect to 
specific characteristics of the consumer of the signal. The numerous existing methods for 
digitizing analog source signals use approaches that sometimes differ fundamentally, but all 
of them can be divided into two main categories: 

i) Methods in which the reconstructed waveform very closely approximates the 
original waveform, i.e., there is no use of irrelevance. The term "waveform-preserving" is 
used here in place of the often applied term "lossless" with respect to the waveform coding 
because the digitization of an analog signal at a limited data rate is, on principle, not 
"lossless" (i.e., infinite entropy of a continuous random variable). 



ii) Non-waveform-preserving methods. These methods are of crucial importance, 
for example, in the audio field, where often only the subjective aural impression at the 
receiver output is decisive (e.g., use of psychoacoustic masking effects). In this case, 
waveform changes (amplitude distortion and phase distortion) undergone by the signal 
through quantization and compression generally only play a subordinate role. Normally, a 
signal processing using irrelevance leads to a reconstructed waveform that differs greatly 
from the original waveform and, moreover, is often afflicted by extensive signal delay (e.g., 
because of spectral transformations or equivalent block-based methods). The transmission 
quality in a method of the type mentioned above cannot be measured by a 10 logio 
signal/noise ratio (SNR) in the conventional way; rather, it must be determined by time- 
consuming listening tests by trained personnel. However, signal coding methods of this type 
are totally unusable for many areas of application (e.g., metrology, recording of waveforms 
for further signal processing at a later time, real-time signal transmission using digital 
modulation methods that do not permit significant signal delay, e.g., for wireless digital stage 
microphones). 

OBJECT AND SUMMARY OF THE INVENTION 

[0004] Therefore, it is the primary object of the present invention to provide a method and 
apparatus for the digitization and compression of analog signals with improved quality. 

[0005] This object is met by a method for the digitization of analog signals comprising the 
steps of digitizing analog source signals, transforming the digitized source signals from the 
time domain to the spherical domain, wherein the transformation is a Z)-dimensional 
transformation with D>2 and logarithmic quantizing of the radius in the spherical domain. It 
is also met by an apparatus for the digitization of analog signals comprising means for the 
digitization of analog source signals, means for the transformation of the digitized source 
signals from the time domain to the spherical domain, wherein the transformation is a D- 
dimensional transformation with Z)>2, and means for the logarithmic quantization of the 
radius in the spherical domain. It is further met by a method for the compression of analog 
signals comprising the steps of digitizing analog source signals by the method described 
above and carrying out a differential pulse code modulation. It is also met by an apparatus 
for the compression of analog signals comprising the apparatus described above including an 
encoder for differential pulse code modulation and a forward prediction device for 



determining a starting value for the samples of the quantization based on the current state of a 
prediction filter, a reconstruction device for reconstructing the subsequent D samples, a 
£>-dimensional logarithmic spherical quantization device for quantizing the values obtained 
by the forward prediction in order to determine a starting cell, wherein the prediction of the 
differential pulse code modulation is run through iteratively in order to determine a 
quantization cell with the smallest quantization error. 

[0006] Accordingly, a method is provided for the digitization of analog source signals in 
which a /^-dimensional spherical logarithmic quantization of the analog source signals is 
carried out. 

[0007] According to one aspect of the present invention, a method and an apparatus are 
likewise provided for the compression of analog source signals in which a digitization of 
analog source signals is carried out in accordance with the method described above, and 
wherein a differential pulse code modulation is carried out. 

[0008] Accordingly, a waveform-preserving method is provided which meets the 
following requirements: a) low data rate through the use of favorable packing characteristics 
of multi-dimensional lattices (vector quantization) and through the use of dependencies in the 
sequence of samples from the source signal; b) extremely high dynamic range, i.e., the SNR 
is constant over a very large modulation range of, e.g., 60 dB or more with respect to the 
short-term variance of an analog source signal; c) high objectively measurable signal/noise 
ratio by segments for short segments of samples, preferably within the meaning of the ratio of 
variance of the useful signal to the mean square error; d) insensitivity to specific signal 
parameters such as probability density function by segment, etc.; and, most importantly, e) 
introduction of an extremely small signal delay on the order of a few (up to ten) sampling 
periods. 

[0009] The invention is based on the idea of providing a method for high-resolution, 
waveform-preserving digitization of analog signals, wherein conventional scalar logarithmic 
quantization is transferred to multi-dimensional spherical coordinates, and the advantages 
resulting from this, e.g., a constant signal/noise ratio over an extremely high dynamic range 
with very low loss with respect to the rate-distortion theory. In order to make use of the 
statistical dependencies present in the source signal for an additional gain in the signal/noise 
ratio, the differential pulse code modulation (DPCM) is combined with spherical logarithmic 



quantization. The resulting method achieves an effective data reduction with a high long- 
term and short-term signal/noise ratio with an extremely small signal delay. 

[0010] Further embodiments of the invention are indicated in the subclaims. 

[001 1] In the following, the invention will be described more fully with reference to the 
accompanying drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0012] In the drawings: 

[0013] Figure 1 shows the proportionality of an arc segment to the radius; 

[0014] Figure 2 is a graph illustrating the quantity of quantization cells associated with a 
radius, normalized to the cell quantity per absolute value of a sample in scalar quantization; 

[0015] Figure 3 is a graph illustrating a signal/noise ratio loss at different values of A; 

[0016] Figure 4 is a graph illustrating the signal/noise ratio according to the present 
invention; 

[0017] Figure 5 shows a block diagram of a DPCM encoder with backward prediction 
according to the present invention; 

[0018] Figure 6 is a graph illustrating the signal/noise ratio according to another 
embodiment example of the invention; and 

[0019] Figure 7 shows graphs of the signal level and of the signal/noise ratio according to 
another embodiment example of the invention. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0020] The aim of the logarithmic quantization is a high dynamic of the quantizer, that is, 
a wide range of the average signal level in which the SNR and, therefore, the maximum 
relative quantization error 

n 

is constant, wherein A#, is the width of the ith quantization interval and r, is the respective 
reconstruction value. This aim leads to logarithmic quantization, e.g., within the meaning of 
the^-law according to K. Trondle, R. WeiB, Introduction to Pulse Code Modulation, 



Oldenbourg Verlag, Munich, 1974. For the range of average modulation, the following 
signal/noise ratio is given for R » 1 according to N. S. Jayant, P. Noll, Digital Coding of 
Waveforms, Prentice-Hall, Englewood Cliffs, New Jersey, 1984: 

10 log 10 (SNR) = R - 6.02dB + 101og 10 (1) 

(l + ln(^)) 2 

where R is the average rate of quantization (bits/sample) and A represents the usual 
parameters of logarithmic quantization; that is, the maximum relative error is constant for 
samples with an amount greater than l/A (with reference to a signal range of the quantization 
of -1 to +1). Therefore, a signal/noise ratio according to (1) is achieved after an average 
signal level of the source signal of 

-5, := 201og 10 (lM)dB. (2) 

Accordingly, the value Bi characterizes the dynamic range of the logarithmic quantization. 
Also, the signal/noise ratio is entirely independent from the probability density function 
(PDF) of the source signal within this dynamic range so that the method can be used 
universally. However, in comparison with a uniform quantization at full drive by an 

equidistributed signal, the term 1 0 log 10 ^3/(1 + In (A)) 2 j to a certain extent represents a 

signal/noise ratio loss by companding, which is the price of a large dynamic range. 

[0021] A quantization in spherical coordinates in D dimensions will be described in the 
following. 

[0022] Spherical logarithmic quantization belongs to the family of vector quantization 
methods (e.g., R. M. Gray, David L. Neuhoff, Quantization, IEEE Transactions on 
Information Theory, pp. 2325-2383, Oct. 1998, Manfred Herbert, Lattice Quantization of 
Speech Signals and Speech Model Signals. Selected Works on Information Systems, No. 79, 
editor: H.-W. SchuJMer, Erlangen 1991). By means of dense spherical packing in many 
dimensions, considerable gains in signal/noise ratios can be achieved in the quantization even 
when no statistical dependencies within the signals can be made use of. In spherical 
quantization, a vector x := (x x ,... 9 x D ) with D samples in Cartesian coordinates is expressed in 

polar coordinates u := (cp, ,...,<p D __j ,r). The D - 1 angle <p\ and the radius r are given by the 



following equations ( where j is an imaginary unit and arg (•) is an argument function which 
indicates the angle of a complex number in radian measure): 

<p, =arg(x, +jx 2 )e[-TC,+7t) (3) 



( 



Z*/ 2 +/*,+! 



\ _ 
e 



2 2 J' 

ie {2,. ..,£>-!}. 



(4) 



<- 2 =I>, 2 

/=1 



(5) 



[0023] The reconstruction of the Cartesian components from vector u is carried out from 
this according to 



x t : = r ■ & f _, • sin (<p,_, ) , i e { D, 2} 



(6) 



x, =r-Vcos((p,) = r-6 0 , (7) 
with the radii 6, of the "parallel of latitude" of the unit sphere (radius 1): 

1> (8) 

D-\ 

^ =1- ]7I cosCcp;), /e{£>-2 5 ...,0} (9) 

[0024] Accordingly, an economical recursive representation is found proceeding from 
<Pd-\- An efficient implementation of forward transformation and backward transformation is 
made possible with little complexity by means of the so-called CORDIC algorithm (described 
in Voider, Jack, The CORDIC Trigonometric Technique, IRE Trans. Electronic Computing, 
Vol. EC-8, pp. 330-334, Sept. 1959). 

[0025] Spherical logarithmic quantization is described in the following. 

[0026] In order to continue to make use of the advantages of logarithmic quantization, i.e., 
the independence of the SNR from the PDF of the source signal, with quantization in polar 
coordinates, the amount (radius) is quantized logarithmically according to the rules of the A- 
law. The angles <p t are uniformly quantized individually, in each instance with the 
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quantization intervals being selected in each instance depending on the higher-order angles 



the reconstruction. In this way, a very simple implementation of the quantization and the 
signal reconstruction is achieved and a stepwise processing of the individual coordinates, as 
in scalar quantization, is preserved by the iterative procedure according to (3) to (9). 

[0027] Due to the proportionality of the length of an arc segment to the radius, the 
requirements for the logarithmic quantization, i.e., the proportionality of the size of a 
quantization interval to the signal value, are already met for uniform quantization of the 
angles. Fig. 1 shows a simple example for two dimensions D = 2. Therefore, the second 
term in (1) must be counted for only one of the total of D dimensions; herein lies the essential 
reason for the gains achieved by means of spherical logarithmic quantization. 

[0028] In order to facilitate the implementation of quantization in spherical coordinates, 
we start from approximately cubic quantization cells, i.e., the surface of a D-dimensional 
sphere with radius 1 is quantized by a network of D - 1 dimensional (hyper-)cubes. As will 
be shown in the following, however, this suboptimal quantization of the spherical surface 
compared to an optimally densely packed lattice results in a loss of only 



in the limit D — > oo , that is, a rate loss of about 1/4 bit/sample, which is accepted for the sake 
of facilitated implementation. 

[0029] The following description explains how the M° (i.e., M := 2 R ) quantization stages 
available per quantization step are optimally divided into the individual quantization intervals 
for the radius and the surface of the £>-dimensional unit (hyper-)sphere. 

[0030] As is conventional, the applicable compressor characteristic for the logarithmic 
range 



of the ^4-law rule is 



(p/ , / g {/ + 1, / + 2, D - 1} that have already been quantized and which are also used for 





(10) 



This is derived by 

r ( r)3 = ^. (11) 
dr r 

[0031] A refers to the free parameter of the AAaw and r 0 is introduced as a normalization 
factor for the radius which will be described more fully in the following. With Mo 
quantization intervals for the radius CDth component of vector w), the width of the 
quantization cell in direction of the radius is given by: 

Ar(r)« ^ = — r. (12) 

M D k'(r) M D c ' 

It will be noted that Ar(r) is not dependent on r 0 in the domain in question. 

[0032] In order that a saturation behavior similar to that in Cartesian coordinates can be 
obtained with quantization in polar coordinates, the maximum value r 0 for the radius is 
normalized in such a way that the Z)-dimensional sphere has the same volume as a D- 
dimensional cube with edge length 2 (corresponding to a quantization range x i e [— l;l] in 

every dimension), i.e., V sphere = V cube : 

a D -r 0 D I 2 D , 

with the volume a D of the Z)-dimensional unit sphere according to J. H. Conway and N. J. A. 
Sloane, Sphere Packings, Lattices, and Groups, Springer-Verlag, 3rd edition. 

D/2 

a D = (13) 

° (D/2)! V ' 

Therefore: 

2 

r 0 where r 0 > 1 VZ) e IN. (14) 

[0033] To facilitate the determination of the resulting signal/noise ratio, r = 1 is used in 
the following, i.e., those quantization cells lying on the surface of a sphere with unit radius 
are considered. This is possible without limiting generality because the SNR is not 
dependent upon radius due to the logarithmic quantization in the domain: 
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7— »• 

The width of the quantization cells is now given by: 

A:= XT— ■ (15) 

M D -c 

[0034] By means of the uniform angular quantization mentioned above, the surface of the 
Z)-dimensional unit sphere is divided into M 9 cells having in each instance the form of D - 1- 
dimensional cubes. For the surface of this D - 1 -dimensional sphere, according to J. H. 
Conway and N. J. A. Sloane, Sphere Packings, Lattices, and Groups, Springer-Verlag, 3rd 
edition: 

S = $ D • r D ~ ] , where $ D =Da D . (16) 

[0035] The division of the quantizing stages into radius and angle is described the 
following. 

[0036] The following considerations must be taken into account for a fair division of the 
M D available quantization cells into radius and surface of the unit sphere: with respect to the 
approximation by cubic quantization cells, i.e., approximately equal dimensioning of the 
quantization cells in all dimensions (radius and arc segment), the surface of the unit sphere (r 
= 1) must be divided into M<p equal (D — l)-dimensional cubes with identical edge length A. 

S = P d Im^^ d ~\ (17) 
Since there are M=2 R available intervals per sample, 

M^-M D =M D (18) 
must be demanded in addition. 

[0037] With (15), (17) and (18), the quantity of the quantization intervals available for the 
quantization of the radius is given by: 

PD c 



- 10- 



[0038] The quotients MoKMIT) of the quantity of quantization cells dedicated to the radius 
are plotted in Fig. 2 normalized to the cell quantity per absolute value of a sample in scalar 
quantization (D = 1) compared to the dimension number for a series of examples of 
parameter A of A -law companding. The values of A are selected in such a way that the use of 
y4-law companding leads to very small quantization intervals which are smaller by a factor of 

2 An ,An e IN than they would be with uniform quantization with an identical total interval 
number: 



Ar(r <l/A) = 2" A " , i.e., 



Mi 



2 An = 



(20) 
(21) 



1 + lnA 

Consequently, the resolution is increased by An bits for small values of r. 

[0039] Fig. 2 shows that spherical logarithmic quantization beyond a factor of 8 allocates 
more intervals to the radius than a scalar quantization with an equivalent resolution in the 
range of very small signal values. Therefore, a gain of 3 bits/sample or 18 dB for spherical 
logarithmic quantization is achieved in this range. 

[0040] The edge length of the quantization cells on the surface of the unit sphere is 
obtained using (15): 

l 

(22) 



M \ c ) 



The constant c is determined by the selected dynamic range, see (2) and (10). 

[0041] The actual uniform quantization of the angle variables for identical arc segments 
on the unit sphere is obtained by means of the recursive equations (8) and (9). A/, intervals 
are available for the coordinate #>,: 



71 

A 



(23) 



7i bj 



(24) 



for e={i)-2,Z>-3,...,2} 



2ti -6, 
A 



(25) 



where \_xj e IN is the greatest whole number < x with x e //? + and 6, corresponding to (8) 



and (9) for quantized angles cp, . 

[0042] It should be noted that the quantity M, of quantization intervals in dimension i is a 
function of the quantization cell selected in dimensions i + 1 ,...,£> - 1 and, therefore, can be 
calculated iteratively starting from Md-\. It is not possible to calculate in advance. 



[0043] The assignment of an index N e <0,1,...,M -l[ to the current quantization cell 



and the reconstruction can be carried out using linked lookup tables as is known from shell" 
mapping, e.g., according to Robert F. H. Fischer, Precoding and Signal Shaping for Digital 
Transmission, pp. 258-281, John Wiley & Sons, Inc., New York, 2002, ISBN 0471 22410 3: 

[0044] The assignment of an index to the radius is not dependent upon the cell index on 
the unit sphere so that the problem of index assignment for the surface of the unit sphere can 
now be considered. 

[0045] The starting number of the index is 0, the maximum ending number of the index is 
My, - 1 corresponding to the quantity of quantization cells on the surface of the unit sphere. 
Md-\ is the quantity of subspheres of dimensionality/) - 1 and can be calculated from (23). 
A first-order lookup table with Md-\ entries N 0 ,... 9 N Md is used for the sake of fast 

implementation. In this connection, TV, designates the smallest index of all cells belonging to 
the ith quantization interval for the angle cp D _ } : 



N 0 =0 



(26) 




/€{0,...,(M D . l -2)} 
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[0046] Accordingly, there are N,-+i - N, cells for a fixed rth value of <p D _| . For every N h a 
second-order lookup table is applied which contains the indices Oy which again designate the 
smallest index of all of the cells of the jth quantization interval for the angle (pjy-2 within the 
D - 1 -dimensional subsphere to which N, refers. 

O i)0 = 0 (27) 



/g{0 ? ...,(M d _ 2 -1)} ? 



0,..., 



M D-2 



A 



[0047] This procedure leads to D - 2 orders of linked lookup tables in order to indicate the 
D - 1 angles (the D — 1 -order lookup table for angle cpj would contain consecutive cell 
indices and must therefore not be tabularized). 

[0048] The quantization noise and the signal/noise ratio are considered at greater length in 
the following. 

[0049] As long as there is a sufficiently large quantity of quantization cells in D 
dimensions, the usual approximation of an equidistributed quantization error can be applied 
within these cubic cells, each cell being represented by its center. The output of the 
quantization noise is given approximately by: 

2 

— D. (28) 
12 

[0050] The spherical logarithmic quantization forces A(r) = A • r in all dimensions for 

— < r < r n . Therefore. 
A 0 
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2 

SNR = - = F(D)M 2 , (29) 

a 2 2 D 

A • r 

12 



2 

where F(Z)) := — • 
D 



C \D 

(30) 



see also (22). As is wanted, the SNR is independent from the variance of the signal in this 
range. 

[0051] Inserting (13) and (16) gives: 

nD)^^({DI2)f D ^ D . (31) 
D D 

[0052] When (29) is considered, F(D) can be interpreted as a loss with respect to the rate- 
distortion bound for iid Gaussian random variables (6 dB-per-bit rule). This relationship is 
shown in Fig. 3 for different values of A. 

[0053] Stirling's approximation 

' x A 



x!« y/2nx 

is used to calculate the limit 

6 A 

lim F(D) = — = -1.53 dB (32) 
D-+ao 7i e 

representing the loss in relation to the rate-distortion bound which is caused by the 
suboptimal cubic quantization cells (relative to (D - l)-dimensional hyperspheres for the 
quantization of the surface of a Z)-dimensional unit sphere and D — > <x> ). In other words, 
spherical logarithmic quantization makes it possible to compensate again for the loss 
described in (1) resulting from companding to only 1.53 dB. Therefore, through the selection 
of the parameter A and Z), it is possible in theory to achieve a dynamic range of any size 
(range of constant signal/noise ratio over the average signal level) without having to accept 
significant losses in the maximum signal/noise ratio that can be achieved. The asymptotic 
SNR loss of 1.53 dB corresponds to a rate loss of 1/4 bit/sample which is acceptable for the 
sake of an extreme reduction in complexity. 
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[0054] In view of the fact that no knowledge of a probability density function for iid 
source signal values can be deliberately assumed for the purpose of logarithmic companding, 
the rate-distortion function for the Gaussian random variables again gives the lower bound 
for the achievable rate (or distortion) at a given distortion (or rate) (Berger's upper bound of 
the rate-distortion function according to T. Berger, Lossy Source Coding, IEEE Transactions 
on Information Theory, pp. 2693-2723, Oct. 1998). Therefore, improvements going beyond 
1.53 dB or 1/4 bit/sample are not possible anyway under the present requirements and 
restrictions. 

[0055] Fig. 4 shows, by way of example, the distance of the signal/noise ratio from the 
rate-distortion bound R • 6dB depending in the average signal level (10 loglO (variance)) of 
iid random variables in Gaussian distribution when A = 48270 for different dimension 
numbers of spherical logarithmic quantization. These simulation results exactly satisfy the 
theoretical analysis according to Fig. 3. At first glance, the very large values for A that are 
used in the examples in Figs. 3 and 4 and in the following examples seem unrealistic and 
impossible to implement. However, it should be taken into account that substantially more 
than M intervals are usually used by means of (19) for the quantization of the radius (M D > 
Mil) (see Fig. 2). Accordingly, even for very low rates, e.g., R = 4 bits/sample, there are very 
fine quantization intervals for the radius. Further, (10) is an invertible function for every 
value of A > 1 and is therefore very well suited in every case for specifying a determined non- 
uniform quantizer. This approach for proper waveform coding with extremely low signal 
delay should preferably not be used for rates below 3 bits/sample. 

[0056] Further, it is clear from Fig. 4 that the dynamic range rises sharply through two 
effects as the dimension number increases with a constant parameter A, For one, the 
normalization radius ro increases; for another, the limiting of the logarithmic compression 
acts on only one dimension, the radius, whereas for D - 1 dimensions the proportionality of 
the expansion of the quantization cell to the signal value (= arc length to radius with fixed 
angle difference) is also retained for very small signal values. Therefore, the dynamic range 
in D dimensions expands approximately to 



B D « B x +201og 10 (r 0 ) + 101og 10 (D). 



(33) 
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[0057] In addition, because of the averaging effect within D values, the saturation strength 
increases so that a further increase in the dynamic range is achieved. There is an infinitely 
large dynamic range for each desired value of A in limit (D -» oo) . 

[0058] As in every vector quantization method, the delay of spherical logarithmic 
quantization by nature of its structure is equal to exactly D samples. As can be seen from 
Figs. 3 and 4, the majority of the possible gains is already achieved at very small values of D 
(up to 5). 

[0059] The combination of a spherical quantization and DPCM is described in the 
following. 

[0060] Correlations between the samples q[k] are made use of efficiently by means of 
differential PCM (DPCM) (see Fig. 5). A prediction error signal x[k] is generated by 
subtracting predicted samples that are obtained by means of a linear predictor filter 

H p (Z) • z~~ ] from reconstructed samples q[k]. For an ideal predictor, the prediction error 

sequence x[k] shows a white power density spectrum (PSD) and minimal variance. This 
predictor filter is usually adaptive in order to adapt to a nonstationary source. With the aim 
of describing the interaction with spherical logarithmic quantization as simply as possible, 
only examples for a fixed predictor filter will be presented in the present case. In audio 
signals, for example, even very short, fixed prediction error filters (designed with respect to a 
compromise criterion) usually offer gains of more than 1 8 dB with a sample frequency of 
44.1 kHz. Even segmentwise gains averaged over 6000 samples (0.136 s) of less than 15 dB 
can be observed only very rarely (see Fig. 7 compared to Fig. 3). Further, the gains are 
limited by adaptive prediction in case the signal delay is limited to a few samples and 
noticeable effects which are produced by updating the coefficients must be prevented. Of 
course, the following material can be generalized so as to apply directly to adaptive 
prediction. 

[0061] It should be noted that a gain in SNR expressed by a reduction in the mean square 
error between the original samples and the reconstructed samples is possible when using the 
so-called backward prediction shown in Fig. 5. 

[0062] With logarithmic quantization, the SNR of the prediction error signal xfkj is not 
dependent on its variance or, to express this differently, the output of the quantization noise is 
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proportional to the signal output. Accordingly, the prediction gain, i.e., the quotient of the 
variances of q[k] and x[k] can be converted directly to a gain in SNR. Therefore, logarithmic 
quantization is a good choice for DPCM. In addition to this, no further signal delay inherent 
to the system is carried out by DPCM compared with PCM because an optimal prediction 
error filter for maximum prediction gain is fundamentally causal and strictly minimum phase 
(see L. Pakula and S. Kay, Simple Proofs of the Minimum Phase Property of the Prediction 
Error Filter, IEEE Transactions on ASSP, Vol. 31, 1983) and can therefore be inverted 
without structure-induced delay. 

[0063] The gradient descent method will be described more fully in the following. 

[0064] Application of spherical logarithmic quantization to DPCM with backward 
prediction is subject to the same set of problems as any other vector quantization method: in 
order to calculate the current prediction error signal x[k], all of the preceding reconstructed 
samples q[k — /], i = 1,2,... must be present. For a high prediction gain, the immediately 
preceding values {i = 1, i = 2) in particular are indispensable. Unfortunately, this requirement 
cannot be reconciled with the quantization of blocks of length D samples. 

[0065] Any of the methods mentioned in the literature for combining vector quantization 
with DPCM can be used to solve this problem. In the present case, a method relying on the 
principle of analysis by synthesis is used. This method is known, for example from CELP 
waveform coding methods according to N. S. Jayant, P. Noll, Digital Coding of Waveforms, 
Prentice-Hall, Englewood Cliffs, New Jersey, 1984, and this approach is combined with a 
discrete gradient descent method. 

[0066] In order to resolve the conflict between DPCM and vector quantization, the square 
Euclidean distance between vectors of samples 

q[l] = (q[l -DUV D + ll^qU D + D-l]) (34) 

and a corresponding reconstruction vector q[l] is minimized / = L^/^J ■ 

[0067] It should be noted at this point that, in addition to spherical logarithmic 
quantization, the calculation of the corresponding prediction error signal and the inversion of 
the prediction error filter must be included in the calculation of a pair q 9 q , wherein 
preceding reconstruction vectors q[l-ni],m = 1,2,... are resorted to, but themselves remain 
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unchanged. The aim of the algorithm is to find those quantization cells for x[l] for which the 
metric 

d 1 (q, q) = X i<AP l + i]-q[D l + i]) 2 (35) 
i=0 

is minimized. 

[0068] In order to find a suitable starting value for the algorithm, we suggest beginning 
with a forward prediction for the current D samples corresponding to a deactivation of the 
chain from ADC and DAC in Fig. 5, or inputting q[k], k = \- DJ - D + \ 9 ...J - D + D - \ directly 
into the predictor filter (in Fig. 5) instead of q[k] . Spherical logarithmic quantization is now 
carried out on the resulting vector x and a starting vector y is generated which in turn gives a 
vector q by conventional inversion of the prediction error filter (DPCM receiver structure). 
In this way, the metric calculation for a given quantized vector >y can be carried out. Starting 
from an actual reconstruction vector^, the reconstruction vectors of all 2D nearest neighbors 
yjfij in D dimensions are determined and the associated metrics are calculated with (35) and 
compared to one another. The vector with the smallest metric is used for the next iteration, 

i.e., argmin y d (q,qjy^) generates an updating of /, wherein ^ corresponds to the quantized 

vector y { and is calculated by inverting the prediction error filter. If no such vector exists, the 
algorithm is terminated and gives the index of the vector >>, to be transferred. It should be 
noted here that the neighbor cells usually have very different indices and it is not a trivial 
matter to identify these cells. When we consider D ~ 3, for example, the angles of the 
azimuth {(pi) are quantized more crudely at large elevations (q>2 near ± 7i/2) than at small 
elevations (q>2 near 0) because the latter have a greater parallel of latitude on the sphere 
surface. For fast implementation, the indices of the neighbor cells could be stored in a ROM 
for all cells provided M 9 is not too large. 

[0069] Since linear prediction is included in the optimizing process, the resulting SNR is 
often greater than can be predicted by adding the gains (in dB) from spherical logarithmic 
quantization and prediction (DPCM). Noteworthy gains in SNR can be observed particularly 
in case of small rates (e.g., R < 5 bits/sample), which will be described in the following and 
with reference to Fig. 6. The Author is not currently aware of an analytic result for the SNR 
that can be achieved through this algorithm. 
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[0070] Simulations show that the required number of iterations is on average 
approximately 0.25 (per D sample) and, when limited to a maximum of 3 iterations, no 
significant losses are observed compared to an unlimited search space, so that the search for 
the optimal quantization cell for low dimension numbers can certainly be implemented in real 
time. Further, small values of D already yield large gains (see Figs. 3 and 6). 

[0071] On the reception side, there is no change compared with conventional DPCM 
methods. It should be noted here that the total delay of the transmission system is only D 
samples and it is excellently suited to transmissions which must meet the requirements of an 
extremely short delay. 

[0072] Variants of the method are described more fully in the following. 

[0073] To speed up the search for the most favorable quantization leading to minimum 
distortion, e.g., according to (35), all algorithms of the lattice decoding are applicable in 
principle or for finding a maximum likelihood codeword in channel coding (see, for example, 
Erik Agrell, Thomas Eriksson, Alexander Vardy, Kenneth Zeger, Closest Point Search in 
Lattices, IEEE Transactions on Information Theory,, pp. 2201-2214, Aug. 2002, and the 
references cited therein). This method is preferably transformed into spherical coordinates. 

[0074] A variant without iterative determination of the quantization cell is provided in that 
a non-uniform quantization of the angle coordinates is carried out instead of taking into 
account statistical dependencies within the D signal values which are actually to be quantized 
by means of a linear prediction filter according to Fig. 5 (DPCM). In this connection, the 
linear statistical links of the D signal values which are actually to be quantized to preceding 
signal values are made use of by means of conventional DPCM with backward prediction. 
That is, there is a backward prediction in which an updating of the prediction filter by D steps 
is carried out after D signal values. 

[0075] It is suggested that the logarithmic quantization of the amount is maintained so that 
the signal points can be normalized subsequently in such a way that they lie on a sphere with 
radius 1. This allows the analytic calculation of the probability density function of the signal 
points on the spherical surface, e.g., assuming a Gaussian signal process from the 
autocorrelation function of the source signal or a direct empirical determination of relative 
occurrences of the signal points on the spherical surface. 
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[0076] A non-uniform cubic quantization of the surface of a sphere with radius 1 can be 
determined in this way for M» 1, e.g., by means of the optimization formula according to 
(9). This formula states that, on the average, the amount of every quantization should be 
equal to the quantization noise. When A/ is the edge length of a (D - l)-dimensional cubic 
quantization cell and z, is the associated center point, that is, the reception-side reconstruction 
vector with amount 1 , and / (v); v := (q> x , ...,q> D _j ) is the probability density function or 

relative occurrence of signal values on the spherical surface, the following equation should be 
approximately satisfied: 

(D-D^-ZCzJAf" 1 = const. (36) 

with the secondary condition ]T A, = f3 D . (37) 

/=l 

[0077] This leads directly to a (D — l)-dimensional compressor function k(v) by which the 
non-uniform quantization of the sphere surface is uniquely determined. The non-uniform 
quantization can then be completed, for example, as in the one-dimensional case, by a 
nonlinear deformation of vector v to a vector z :~ k(v), subsequent uniform quantization to 

vector Zi according to III-C, and application of the inverse function v := k~ l (z, ) subsequently 

for obtaining the reconstruction vector. 

[0078] The resulting (D — 1 )-dimensional compressor function should preferably be 
approximated by an analytically describable function or, analogous to the usual 
representation in one-dimensional compression by means of straight-line segments (e.g., 13- 
segment characteristic line, see [5]), by (D - 2)-dimensional partial planes (with constant 
partial derivations). When this approximation is latticed orthogonally over the space 




over the D - 1 angles, the compressor function can be evaluated in a simple manner. 

[0079] To simulate spherical logarithmic quantization in combination with DPCM, the 
quantization of the overture and the aria "Der Vogelfanger bin ich ja" from the opera 
"Zauberflote" by Wolfgang Amadeus Mozart was examined (Philips Classics Productions 
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1994 (DDD), Mozart, "Der Vogelfanger bin ich ja" (Die Zauberflote), Polygram Records 
#442569-2, track 3). 

[0080] Fig. 6 shows simulation results for R = 3, R = 4 and R = 7 bits/sample when ^4 = 
102726, ^ = 48270, A = 4858 and with different dimension numbers D. 

[0081] The signal was first coded, then decoded and the SNR was calculated by 
comparison with the original CD signal. The signal/noise ratio is averaged over the entire 
piece of music. At the low rates, the SNR rises more steeply with D compared to Fig. 3. The 
reason for this is an improved averaging. The different values of parameter A must be taken 
into consideration when comparing the results. The aria "Vogelfanger" is stored on audio CD 
with an average signal level of -32.15 dB and offers an excellent example for audio signal 
coding because of the signal dynamics and tone colors (prelude, singing, panpipe). These 
simulation results can be regarded as representative for a large number of audio signals 
because a universal predictor filter of low order (P = 2) was used. 

[0082] Fig. 6 shows the measured signal/noise ratios averaged over the entire piece of 
music. It is apparent from a comparison of Fig. 3 (or (1)) and Fig. 6, where D = 1, R = 7, that 
the average prediction gain for this simple predictor is about 20 dB to 23 dB (sample 
frequency: 44.1 kHz!). When R = 7, the total gain in all values of D can be favorably 
approximated by the sum of the two individual gains from spherical logarithmic quantization 
and DPCM; when R = 4, this applies only for D > 5. (Of course, (1) is not applicable, for 
example, for R = 3 or R = 4 and D = 1, A = 48270.) 

[0083] The average signal level of the overture is around -27.20 dB; this piece of music is 
characterized by a very large dynamic range from below -70 dB to — 1 7 dB (see also Fig. 7). 
The segment signal level and the SNRs for the overture example are plotted in Fig.7. Each 
segment comprises 6000 samples and therefore corresponds to a time range of 0.136 s. In 
this case, spherical logarithmic quantization is applied in D = 3 dimensions with ^4 = 1014 
and A = 4858 and in D = 6 dimensions with A = 48270 and with the same compromise 
predictor of the order of P = 2 as in the first example. The top curve shows the large 
dynamics brought about by the known loud tones divided by pauses over a plurality of bars at 
the start and in the middle of this popular piece of music. The curves for A = 1014 and A = 
4858 disclose the advantage of an increasing A with respect to the SNR by segment, wherein 
the pauses should be noted in particular. The bottom curve in Fig. 7 demonstrates this aspect 
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for a further increase in A and the resulting further gain in SNR of 3 dB because of the higher 
dimension number D = 6 (see also Fig. 3). It should be noted that in spite of the low rate of R 
= 4 bits/sample, a 10 logio(SNR) >35dB (with respect to the original CD) is maintained even 
in the pauses. 

[0084] The value A = 48270 corresponds to a compression of 12 bits, e.g., for D = l,R = 
12 a resolution of 24 bits is achieved for small signal values. Of course, there is never a 
resolution corresponding to 16 bits for D = 1, R = 4 because the interval number is too low. 
However, when spherical logarithmic quantization is applied in 6 dimensions, the extremely 
large value of A = 48270 is actually well chosen for a high minimum value of the SNR. In 
this case, the resolution is even identical to that of the original CD data for segments with a 
signal level of -70 dB and lower. To date, no one has been found, even among trained 
personnel, who was capable of reliably hearing a difference between the quantized signal at R 
= 4 y D = 6,A = 48270 and the original signal. 

[0085] The preceding description shows a waveform-preserving digitization method for 
analog source signals which, on the one hand, combines the gain by multidimensional 
quantization with the advantages of logarithmic quantization and, on the other hand, is 
capable of additionally increasing the objective signal/noise ratio through the addition of 
prediction gains. In conclusion, it is noted that, in addition to a favorable exchange of rate 
and distortion, this method is distinguished above all by an extremely large dynamic range 
accompanied at the same time by an extremely low system-induced delay of the signal by 
only a few sample intervals. 

[0086] While the foregoing description and drawings represent the present invention, it 
will be obvious to those skilled in the art that various changes may be made therein without 
departing from the true spirit and scope of the present invention. 



ABSTRACT 



[0087] The invention is based on the idea of providing a method for high- 
resolution, waveform-preserving digitization of analog signals, wherein conventional 
scalar logarithmic quantization is transferred to multi-dimensional spherical 
coordinates, and the advantages resulting from this, e.g., a constant signal/noise 
ratio over an extremely high dynamic range with very low loss with respect to the 
rate-distortion theory. In order to make use of the statistical dependencies present in 
the source signal for an additional gain in the signal/noise ratio, the differential pulse 
code modulation (DPCM) is combined with spherical logarithmic quantization. The 
resulting method achieves an effective data reduction with a high long-term and 
short-term signal/noise ratio with an extremely small signal delay. 



